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EXTREMES ON TREES 
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This paper considers the asymptotic distribution of the longest 
edge of the minimal spanning tree and nearest neighbor graph on 
Xi,..., Xjv n where Xi, X 2 ,... are i.i.d. in 5R 2 with distribution F and 
N„ is independent of the Xi and satisfies N n /n —> p 1. A new approach 
based on spatial blocking and a locally orthogonal coordinate system 
is developed to treat cases for which F has unbounded support. The 
general results are applied to a number of special cases, including 
elliptically contoured distributions, distributions with independent 
Weibull-like margins and distributions with parallel level curves. 


1. Introduction. Recall that the (Euclidean) minimal spanning tree (MST) 
on a finite set of points (Xi,X2,... ,Xjv) in K 2 is the connected graph with 
these points as vertices and with the minimum total edge length. The (Eu¬ 
clidean) nearest neighbor graph (NNG) on (Xi,X2,..., Xjv) is the graph 
on which each point X,; is connected to its nearest neighbor in the set. In 
this paper the X* are assumed to be random and we are interested in the 
asymptotic distribution of the longest edge on these graphs as N —> 00 . 

Penrose (1997, 1998) considered these problems by assuming that the 
Xi are uniformly distributed in a unit cube or symmetrically normally dis¬ 
tributed in The essential ideas are that (a) the lengths of the edges at any 
location in space depend primarily on the Xi in the vicinity of that location 
and as a result are asymptotically independent of the edges in other parts of 
the space and (b) the presence of an extremely long edge is a rare event and 
hence the likelihood of having multiple extremely long edges at any location 
is asymptotically negligible compared with the likelihood of having one such 
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edge there. Clearly (a) is also essential in proving central limit theorems for 
the total edge lengths; see Kesten and Lee (1996), Lee (1997) and Penrose 
(2000). In view of (a) and (b), the asymptotic distribution of the longest 
edge of MST or NNG can be established through a Poisson convergence 
of the number of extreme edges, which in Penrose (1997, 1998) is achieved 
through the Chen-Stein method. 

Specifically, we will consider the case where the random graphs are based 
on Xi,...,Xjv„ where Xi,X 2 ,... are i.i.d. with distribution F and N n is 
independent of the X,; with N n /n-^ 1. We are primarily interested in the 
case where F has an unbounded support (although our methodology works 
rather generally also if F has bounded support). In particular, we will focus 
on those F whose density is of the form 



where t/(x) is regularly varying in some sense and satisfies suitable regu¬ 
larity conditions. This covers many elliptically contoured distributions, and 
in particular correlated normal distributions and distributions with inde¬ 
pendent Weibull-like marginals as special cases but also large classes of 
other distributions. Poisson approximation is the key idea. However, we 
use a direct approach of spatial blocking as opposed to the Chen-Stein 
method. Computations of integrals of the type f A e~ nF ^ s ^ x ' ,r ^ dF(x), where 
S(x;r) = {y : |y — x| < r} is the sphere centered at x with radius r, are a 
key part of the solution for the problem on hand. One of the novelties of 
our approach is the introduction of a locally orthogonal coordinate system 
with respect to the level curves of U, which enables particularly effective 
handling of such integrals. 

This paper is structured as follows. Section 2 introduces the notation and 
a spatial blocking argument, as well as other preliminaries. The development 
of a locally orthogonal system is made in Section 3. The main results are 
given in Section 4. Sections 5 and 6 consider homogeneous level curves and 
parallel level curves, respectively, and most of the proofs are given in Sections 
7 and 8. 

Possible extensions of the results in this paper include (a) allowing the 
dimension d to be general, (b) allowing the distance measure to be more 
general (e.g., considering weighted edges on the graphs), (c) considering the 
/c-nearest neighbor graph in which each point is connected to its k nearest 
neighbors. The solutions for these involve additional technical details but 
probably few new important ideas. 

2. Fundamentals. For convenience of notation, denote by MST(IV) and NNG(IV), 
respectively, the MST and NNG on two-dimensional random variables Xi,..., Xjv 
for any random variable N defined on the same space as the X,;. Also let 
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Mq be the longest edge of G, where in this paper G will be either a MST 
or NNG. 

As outlined in Section 1, the graphs of interest will be based on the points 
Xi,... , Xjv„ where Xi,X 2 ,... are i.i.d. with distribution F, N n is positive, 
integer-valued with 

Njn^l, 

and N n and the Xj are independent. This assumption will hold throughout 
the paper. The random quantities whose asymptotic distributions we study 
in this paper are Mmst(jv„) and M NNG (N n )- We will first assume that N n is 
Poisson distributed with mean n, in which case the points Xi,... ,Xjv n can 
be thought of as the points of a Poisson process with intensity measure nF. 
The independent-increment property of the Poisson process offers an obvious 
advantage in proving Poisson convergence. We will later show how the result 
based on the Poisson assumption can be extended to the general class of 
point processes described here. 

Temporal blocking is a common technique for proving limit theorems for 
weakly dependent random variables. See Ibragimov and Linnik (1971) and 
Leadbetter, Lindgren and Rootzen (1983). Our first theorem, which basically 
is a Poisson convergence result for M NNG (jv„)i for the case N n ~ Poisson(n), 
is based on a spatial blocking argument. For r > 0 and measurable set A, 
define 

(1) fj,W(A,r) = n[ e~ nF ^ s ^dF{^). 

JA 

Now, let us say that points with their nearest neighbor at least r away 
are r-separate. With this terminology, Hn(A,r) is the expected number of 
r-separate points in A. Similarly, let 

^\A,r) = n 2 [ f I (r<|x „ y|<2r) e-^(^) uS ^^)dF(x)dF(y) 

JA JA 

be the expected number of pairs of r-separate points such that the distance 
between the points is larger than r and smaller than 2 r. Here and elsewhere, 

| • | denotes the Euclidean norm. 

In the theorem, for each n, A n t , 1 < i < k n , are suitably large and suitably 
separated spatial blocks. The separation has to be large enough to make the 
occurrences of r n -separate points independent from block to block. This is 
ensured by condition (a). Condition (c) says that nothing of importance 
happens on the leftover parts between the blocks. Uniform asymptotic neg¬ 
ligibility of the number of r„-separate points in the individual blocks follows 
from (d). Condition (e) prevents clustering of r n -separate points. Finally, 
condition (b) is the basic norming condition of convergence of the expected 
number of r n -separate points to the mean of the limiting Poisson distribu¬ 
tion. 
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Theorem 1. Let N n have a Poisson distribution with mean n and let 
{k n } be a sequence of positive constants tending to oo. Suppose that {r n } is 
a sequence of positive constants and for each n, A n> i ,..., A n ^ n are disjoint 
measurable sets in !Ji 2 such that: 

(a) mini<j^< fen inf (|x - y|: x 6 A n>i , y G A nJ ) > 2 r n for all large n, 

(b) lim n _ >00 fiiP (3ft 2 , r n ) = some r € (0,oo), 

(c) lim n _ >00 ((Uf=i A nti ) c , r n ) = 0, 

(d) lim n ^ 00 maxi<i< fcn {A Hti , r n ) = 0, 

(e) lim^oomaxi <i<k n [pn\A nii ,r n )/p^\A n>i ,r n )\ =0. 

Then the number of r n -separate points asymptotically has a Poisson distri¬ 
bution with mean t, and thus in particular, 

(2) -P(-^nng(jv„) < r n ) —> e r . 


Proof. For convenience write V n = {Xi,... , Xjv n }. For any set A, de¬ 
fine 


N(x, A) = inf (|x - y| : y G A \ {x}), 

namely, the distance from x to its nearest neighbor in A. We will show 
the stronger result that X) x ePn ^(A(x,R n )>r„) converges in distribution to the 
Poisson distribution with mean r, from which (2) follows at once. Let 

/4 2) (A, r) = n 2 jf jf / ( | x _ y | >r) e- nF(S(x;r)u5(y;r)) dF(x) dF( y) 

be the expected number of pairs of r-separate points in A with the points 
at least a distance r apart. It is easy to check that 


( 3 ) 

and 



(iV(x/P n )>r) 


= h { n (A r) 


( 4 ) 


Write 


E 


E 


l 


(JV(x,?„)AiV(y,P„)>r) 


= £\a,t) 


'X,y€?nni,x^y 


kn 

E I (.N(*,Vn)>r„) = E E I {N(^,Vn)>r„) + E / (A(x,R„)>r„) ■ 

x.£Vn *=i xeR„n A U:i xe'P„n(lj‘ : " 1 A n:i ) c 


Roughly speaking, in the following we will establish that the second sum is 
negligible and for the first sum that its expectation tends to r, the summands 
are infinitesimal and the probability that any summand is bigger than 1 is 
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negligible compared with that of it being bigger than 0. More precisely, we 
will show that 

E I(N(x.,Vn)>r n )’ 1 

are independent random variables for each n, 


P l I(N(x.,'Pn)>r n ) > 0 j ~^ 0) 

P ( E V(x,7M>.n) > 0) -> 0, 


vxeP„nA 


n 1 |y_l n .,2 


kn / 

E p E 


^(N(x,'P„)>r n ) > 0 ) T i 


max 


i= 1 VxG'P„nA n|i / 

I(N(x.,Vn)>r n ) > 1 ) 


0. 


( 5 ) 

( 6 ) 

( 7 ) 

( 8 ) 

(9) 

l< 2 <fc„ -P(I] xG -p n nA n!i I(N(x,V n )>r n ) > 0) 

It is straightforward to see [cf. Corollary 7.5 of Kallenberg (1983)] that (5)- 

(9) imply that ExePn -7(jv(x,p„)>r„) converges in distribution to the Poisson 
distribution with mean r. 

We now show (5)-(9). First, it is clear that Exe7»»n J 4 n , i I {N{x,V n )>rn) is 
completely determined by the set V n C\ {A n ^j) Tn where (A)s = {x: d(x, A) < 5}. 
Hence, by the independent-increment property of the Poisson process and 
condition (a) of Theorem 1, we conclude that (5) holds. To show (6)—(9), we 
first note the Bonferroni inequality: 

(10) ^(A,r n )~ ffl(A,r n )<p( J2 I {N(x,v n )>r n ) > 0) < l${A,r n ). 

VxeRnnA / 

Using the rightmost inequality with A = (Uf=i ^n,i) c and A = H n> j, respec¬ 
tively, (6) and (7) follow from conditions (c) and (d), respectively. 

Next, note that since _F(S(x; r n ) U S(y; r n )) = F(S(x; r n )) + F(S(y; r n )) 
for |x y| > 2 r n , 

^\A,r n )-^\A,r n ) = u 2 f [ / ( | x _ y | >2M e-^^)uS( y;r „)) dF(x) dF(y) 

J A J A 

< (Vn\ A ,r n )) 2 . 

Hence, by conditions (d) and (e), 

/In (A ni ,r n ) 


lim max ... 

n—>oc l<i<k n ,M) 


= 0 . 


- - n (An i,r n ) 


( 11 ) 
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By (10) and (11), conditions (b) and (c) now imply 

kn f \ fcn 



= lim (3i, r n ) = t. 

n—>oo 


Finally, since 




(9) follows from (10) and (11). □ 

As a simple example, if F is the uniform distribution on [0,1] x [0,1], 
then it is straightforward to verify (a)-(e) of Theorem 1 by taking r n = 
( logr^ogr ) 1 / 2 of Penrose (1997)] and the A Ut i to be the sets [(j — 

1 )n -1 / 4 , jn -1 / 4 — n -1 / 3 ] x [(A: — 1 )n -1 / 4 , fcn -1 / 4 — n -1 / 3 ], j,k = 1,..., [n 1 / 4 ]. 
The advantage of this approach will be more obvious for more complicated 
distributions. 

The next result shows how to generalize Theorem 1 by removing the 
Poisson assumption on N n . 

Theorem 2. Suppose that 



( 12 ) 


and for some 6 < 1, 



where F = 1 — F. Then 


P(M NNG(AT n ) <r n < M NNG (n)) + -P(M N NG(n) < fn < M N NG(iV n )) —► 0 


for any sequence of positive, integer-valued random variables N n with N n /n 1. 

Proof. First for any positive constant e n , 

P(M N NG(iV n ) < r n < -^NNG(n)) + ^(^NNG(n) <T n < M NNG (jv n )) 


< P(\N n -n\> ne n ) + £ P(N n = j ) 



+ P( M NNG(n) <r n < M NNG (j))) • 
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Pick e n to tend to 0 slowly enough so that the first term tends to 0 and so 
we only have to deal with the second term. It suffices to show that 

(14) max P(M NNG (j-) < r n < M NN G( n )) —> 0 

j£[n( 1—e„),n(l+e„)] v ' 

and 

(15) max P(M NNG ( n) < r n < M nng(j -))0. 

To show (14), suppose first that j G [n(l — e n ),n — 1]. Observe that if 
Mnng( j) < r n < Mnng( n)i then there exists i G {j + 1,..., n — 1} such that 
Ai <s<n, s ^i\ x i ~Xs\> r n . Hence 

max H(M NNG( ■) <r n < M NNG(n) ) < ne n f F n_1 (S(x; r n )) dF(x) -*• 0 

by (12). Next, let j G [n + l,n(l + e n )]. Suppose that M NNG (j) < r n < M NNG(n) 
and that the longest edge on NNG(n) initiates from so that Ai< s <n,s^fc — 
X s | > r n . Since the longest edge becomes < r n by adding points X n+ i,..., Xj 
to the graph, one of those additional points must be within a distance of r n 
from Xfc. Thus, 

P(M NNG (j) <r n < M NNG ( n )) 

< P ^for some k G {1,... , n} and i G {n + 1,..., j}, 

A |X fc - X s | > r n but |X fc - Xf\ < r n 
l<s<n,s^fc 

<n 2 e n J F(S(x;r n ))F n_1 (S(x;r n ))dF(x) -> 0 

uniformly for j G [n + 1,77.(1 + e n )] by (13). This proves (14). The proof 
of (15) is similar, where the main difference is that for j G [n(l — s n ),n — 1], 

H(M NNG(n) <r n < M nng( j)) <r?e n J F(S(x; r n ))F- J_1 (5(x; r n )) dF(x), 

which, again by (13), tends to 0 uniformly for j G [n(l — e n ),n — 1]. □ 

We note that (12) follows readily from (b) of Theorem 1 since F(S(x m , r n )) n < 
e — nF(s(x;r„)) . indeed, for the cases that we will consider, the conditions 
(12) and (13) are both naturally satisfied. Thus, once we have the asymp¬ 
totic distribution of the extreme edge length for the Poisson NNG, we can 
extend that at once to a more general class of NNGs. 

The following result gives an argument for deriving the asymptotic dis¬ 
tribution of M MST ( 7 v n ) from that of M NNG (jv n ), in light of the fact that 
Mmst (jv„) > M nng (tv„) (see proof below). 
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Theorem 3. Suppose that for any sequence of positive constants m n 
with m n /n —> 1 , we have 


(16) n j [F m ”(5'(x;r n )nS(0; |x|))—T’ mn (S , (x;r n ))]/(|x| > r n /2) dF(x) —> 0. 
Then 


P(M N NG(A n ) <r n < M M s T (a r n )) —>• 0 

for any sequence of positive, integer-valued, random variables N n with N n /n 1. 


Proof. The proof is basically the same as that of Lemma 4 of Penrose 
(1998). For completeness, the essential ideas are reproduced here. Again 
write 

P(M N NG(A n ) < r n < -^MST(iV n )) 

(17) < P(\N n /n — 1| > £ n ) 

+ ^2 = m)P(M NN G(m) < r n < M MST( - m )), 

|m/n—1|<£„ 

where e n tends to zero slowly enough so that the first term tends to 0. Now 
make Xi,... ,X m a graph by including an edge between each pair of points 
at a distance at most r. Denote the resulting graph by G r . Clearly, for a 
small enough r, G r comprises connected subgraphs, called r-clusters, which 
are disconnected with one another. Observe that 

Af M ST(m) = inf{;r: G r is connected}, 

M N NG(m) = hif {r:G r does not contain an r-cluster which is a singleton}. 

Thus, M MST(m) > M NNG(m) . Suppose M NNG(m) < r n < M MST(m) . Then it 
means that G Vn is disconnected and each r n -cluster has at least two points. 
Take an r n -cluster and let x be the vertex in the cluster which is closest 
to 0. Write = {Xi,... ,X m } \ {x}. Clearly 

I*n5(x;r„)/0 and X^ C S(x; r n ) C 5(0; |x|) = 0 

since S(x;r n ) contains points and only points belonging to the same r n - 
cluster while 5(0; |x|) \ {x} does not contain points from the same r n -cluster. 
This means that if M NNG ( m ) <r n < , then there are at least two 

points belonging to different r n -clusters having the described property, where 
one of them must have modulus bigger than r n /2. Thus, 

P(M N NG(m) < r n < ^MST(m)) 

( m 

^/(|X i |>r n /2,X^n5(X i ;r n )^0 

i =1 
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and 1% n S(Xn r n ) n 5(0; |X,|) = 0) > 1 

/ m 

<E[ > r n /2, n 5(Xj;r n ) / 0 

and 2^ n 5(Xi; r n ) n 5(0; |X 4 [) = 0) ) 


, 2—1 


= E hr I(|Xj| > r n /2,2^ n 5(X i5 r n ) n 5(0; |X,|) = 0) 

\*=i / 

- E[ jriUXi] > r n /2,I% n 5(X ? :;r n ) = 0)) 


, 2—1 
Tim—1 / 


= mJ[F m i (5(x; r n ) PI 5(0; |x|)) — F m X (5(x; r„))]/(|x| > r n /2) dF(x), 
which tends to 0 by (16). The result follows from this in view of (17). □ 


While it may not be easy to make an intuitive connection between the 
condition (16) and its conclusion in Theorem 3, (16) holds quite naturally 
for the cases that we study in this paper. 

It might also be worth noting that it is easy to find examples where 
the Poisson convergence which is used for Theorem 1 does not hold. In 
fact, this is the rule rather than the exception in the one-dimensional case, 
and for example can be seen to be the case for the one-dimensional den¬ 
sity e' x '/2. A simple two-dimensional example is then obtained by letting 
the two-dimensional distribution be concentrated on the aq-axis and have 
this density. If this distribution is mixed with, say, the standard bivariate 
normal distribution, a more genuine two-dimensional example where Pois¬ 
son convergence does not occur is obtained. This example can also be simply 
modified to have a smooth density. 


3. Time-varying, locally orthogonal coordinate system. In the remaining 
part of the paper, we will focus on densities of the form 

/(x)=e- f/ W, 

where U (x) is continuous and each level curve (U = u) := {x: U (x) = u} is a 
closed and convex curve. This guarantees that / is monotonically decreasing 
in some sense. We first introduce a “locally orthogonal” coordinate system 
which is useful for our computations, in particular for computations of the 
basic quantity /Xn\A,r) defined by (1). Let 

VfJ(x) = (lK 1,0 )(x), t/( 0,1 )(x)) 
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be the gradient of U(x) at x. This gradient is throughout assumed to be 
continuous. 

For points x for which U (x) is large, we define the transformation x —> 
(£, u ) where u = U (x) and £ is determined in the following manner. We use 
a level curve (U = w), with w specified below, as a reference. If x belongs to 
the reference curve (U = w), let £(x) be the arc length from an arbitrary but 
fixed point x Q to x, measured counterclockwise, say, on the curve ( U = w). 
In general, if x is on the level curve (U = u), where u^w we let £(x) = £(x') 
where x' lies on (U (x) = w) and x and x! are connected by a curve which 
is orthogonal to each level curve between (U = w) and (U = u), in the sense 
that the normal of any intersecting level curve at the point of intersection is 
in the same direction (modulo 7r) as the tangent on the connecting curve at 
the point. The assumption that V(x) is continuous assures that the trans¬ 
formation x —> (£, u) is one-to-one. The reason for this choice of the second 
coordinate, £, is that it gives the Jacobian for the transformation to the new 
coordinate system a relatively simple form. The important thing to keep 
in mind is that this coordinate system depends on the reference curve. In 
what follows we refer to the index n in N n as “time.” In the computations 
that follow, we see that at time n when n is large, all the action takes place 
in a neighborhood of the level curve (U = logn) (see proof of Theorem 4). 
Hence at time n it is natural to set w = logn and use (U = logn) as the 
reference curve in defining £. For this reason we shall throughout the rest of 
this paper adhere to this particular coordinate system. In doing so, here and 
elsewhere, the reference of £ to time will be suppressed (i.e., instead of £ n 
we simply write £) for convenience of notation. Also, whenever there is no 
ambiguity the notation £,x will denote the functions £(x),x(£,u) as well as 
their possible values. 

We now derive the Jacobian for the coordinate change from x to (u,£). 
Note that the unit normal vector at x on the corresponding level curve is 

VU(x) 

WW)l 

By the way in which the transformation is defined, 

dx 1 W(x) W(x) 

1 ’ du |W(x)| |VC/(x)| |VC/(x)| 2 ' 

We now derive ^§jr)- Clearly the unit tangent vector at x on the cor¬ 
responding level curve is 
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If x is such that 17(x) = logn, then £ corresponds to the actual arc length 
and so 

(19) ^ = unit tangent at x = ^ (t/ (0|1) (x), -1/ (1,0) (x)), 

in which case the Jacobian is equal to 


dx 


dx\ 8 x 2 dx,\ 8 x 2 

1 

d(£,u) 


du dt d£ du 

|Vf/(x)| 


In general, the arc length from x(£\,u) to x(£ 2 ,u) is computed as 

(20) t'^~ |x (1,0) (-1, n)| d£. 

h i 

Thus, if l changes by d£, then the arc length changes by |x( 1,0 ) (£, n)| dl and 
therefore 


<9x 


x 


( 1 , 0 ) 


(£,u)\ 


|Vt/(x)| 
Consequently the Jacobian is 

dx 


( 21 ) 


d(£,u) 


(U^ 1 \x),-U^°\x)). 

|x( 1,0 ) (£,u) | 

\VU(x(£, u))|' 


For convenience write 


(22) W,u) = \VU(x(£,u))\ 

and 


(23) A (u) = length of the level curve (U = u). 

Thus, changing variables x —> (£, u ) gives 

g -nF(S(x;r)) 

- e -nF(S( X (t,uy,r)) e -u |x (1 ’°^, n)| ^ 

Ja £{£,u) 

since dF(x ) = e~ v ^ dx, where A is the image of A under the transformation 
from x to {£, u). 
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4. Main results. In this section we formulate conditions directly in terms 
of C/(x), the negative logarithm of the density /(x) = e~ u ( x \ which lead 
to convergence of the longest edges of the nearest neighbor graph and the 
minimal spanning tree. As stated in the beginning of Section 3, we focus on 
functions U for which the gradient is continuous and the level curves are 
convex. Additional conditions on smoothness and other aspects of U will be 
stated below in Assumptions A1-A6. The conditions are rather technical. 
Some discussion of their meaning is given after the statement of Theorem 5. 

The core of the problem is to find sequences r n for which the mean number 
of r n -separate points converges, that is, which satisfy linin^oo ,r n ) = 
some r £ (0,oo). Denote by £ a = t nfi any point for which 

(25) i n := i(£ 0 , log n) = mm i{£, log n). 

We assume that there exist finite positive constants ci,C 2 and a sequence 
r] n which satisfy Assumption A5, for r n defined as 

r , = rIn - ci log rj n - log(c2 Vy/^f) 

in 

Throughout assume that r n is given by this expression. 

For convenience write log 2 = log log and log 3 = log log log. The following 
set of assumptions are needed: 


Assumption Al. For each large u, given any e > 0 there exists 5 > 0 
such that |V£/(x)/|Vf7(x)| — VC/(x , )/|VC/(x / )|[ < e for all x,x 7 on U = u 
with their distance on the curve less than 5X(u). 

Assumption A2. limsup™ < °°- 

Assumption A3. For all large u, both A (u) and inf x . u(*)=u IVU(x)| 
belong to the range [u~ p ,u p ] for some p £ (0, oo). 

Assumption A4. lim u( x )^oo ivutx^ 1 ( lp g^( x )) 2 = 0 for an y bJ > 0 
with i + j = 2. 

Assumption A5. There exist positive constants ci, c 2 and a sequence of 
constants rj n —> oo with rj n = 0( log 2 n) such that [with i(£, log n) = \ V!7(x(f, logn))|, 
as before]: 

(a) lim n _ >0O e _7?ri rfn^ 1 ^ 2 f £ X J; g° sn ^ e _ K^ ,logri ) _ ^^°’ logri ^ rTl [i(l, logn) i(£ 0 , logn)] 1 / 2 dl 


C2, 
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(b) lim* 


s Vn 


0 . 


r]n sup e K(*> logTl ) £(&»i°g ")]*•« [^(^logn) £(4,, logn)] 1 / 2 dt 


Assumption A6. lim,^^ inf x . c/ (x)> u( x /|x|, VC/(x)/|V?7(x)|) > 0. 

In the following theorems let N n be a positive integer-valued random 
variable such that N n /n-^>l. Also define 

Vn — c\ log r\ n — log cf 1 \f 7 h\ 

(27) [i n =--- and a n = l/£„, 

S n 

and denote by 

A(x) = exp{— e~ x }, — 00 < x < 00 , 

the Gumbel distribution function. 

Theorem 4. Assume that for each r > 0, Assumptions A1-A5 hold 
with r n defined by (26). Then 

(M N NG(jV n ) - Tn) I A. 

Theorem 5. Assume that for each r > 0, Assumptions A1-A6 hold with 
r n defined by (26). Then 

(MmST(JV„) — hn)/&n-+ A. 

Clearly, if the conclusions of the theorems hold, then by weak convergence 
one has some flexibility in choosing the normalization; indeed, any fj n and 
with 

fjn-Vn-^0 and i n /t,n 1 

can replace rj n and f n . 

Assumptions A1-A6 are designed to meet not only analytic but also ge¬ 
ometric considerations in the proofs, but still are quite general. Assump¬ 
tion A1 means that the normalized gradient of U on a level curve f7(x) = u, 
where u is large, will change gradually when the location changes gradually 
relative to the total length of the level curve. Assumptions A2 and A6 imply 
that the level curves are not highly asymmetrically shaped. Assumption A3 
says that both A (u) and |V(u)| are O-regularly varying functions [cf. Bing¬ 
ham, Goldie and Teugels (1987)]. Assumption A4 is a very weak technical 
condition on the smoothness of U. Assumption A5 is the most significant 
condition in this group, which is aimed at connecting the normalizations 
for the longest edges on the graphs to the second-order Taylor expansion of 
£(£, logn) in areas where £(£, log n) is close to its minimum value £(•£„, log n). 
To further illustrate what these conditions mean and how to verify them, 
we proceed to examples in the next two sections. 
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5. Homogeneous exponents. The first general example considers distri¬ 
butions with exponents which are homogeneous in the sense that /(x) = 
e -t'(x) _ e -t (x)-c w j ^] 1 v(k'x) = k a V(x), where the level curves for V are 
convex. In this case it is far more natural to use polar coordinates than the 
time-varying, locally orthogonal coordinate system in the general theory. In 
polar coordinates, x = (rcos0,rsin0); we may then write V(x) = r a g(9), so 


that 

(28) 


/(x) = e ~ u M = e - ra9{e) ~ c . 


Straightforward computations show that 


(29) 


VU(x) = r a ~ 1 A{e) 


cos 0 
sin0 


| W(x)| = V(0)| 1/2 =: r a ~ 1 h(9), 


where 


m 


'ag(O) -g'( 0 )' 
. g'{0) oig{ 6 ) _ 


and \A(9)\ = a 2 g(9) 2 + g'(9) 2 . 


Defining v c (ri) = logn — c, we have that t/(x) = logn is equivalent to 
V(x) = v c (n). Hence, writing r(0,logn) for the solution to Ufx) = r a g(9 ) + 
c = logn and x(0,logn) for the corresponding x, we have that r(0,logn) = 
v c (n) l ^ a g{9 )~ 1 /° 1 , and 

|(0,logn) = |VC/(x(0,logn))| 

(30) 

= v c {n) 1 ~ 1 / a g{9)- l+l / a h{9) =: ^(n) 1 " 1 /® k{9). 


The final ingredient needed to treat distributions of the form (28) is to note 
that = | d pp | and use 

x( 6 , logn) = r(9, logn)(cos 0, sin 9) = v c (n) 1 /“</(0) _1 / a (cos0,sin0) 


to obtain 


(31) 


dl{6) 

89 


= v c (n) 1 / a g(9)- 1 / a 


m 

ag(9) 



=: u c (n) 1//Q m(0). 


Theorem 6. Assume that the density is of the form (28) with a > 1 and 
g(9) bounded away from zero and three times continuously differentiable on 
the torus [0, 2tt\. Suppose further that k{9) assumes its minimum value at d 
distinct points 9 q,... ,0^-1- Then Assumptions A1-A6 hold with £ n /£ n —> 1 
for in = (logn) 1_1 / Q A:(0o), g n = log 2 n, a = 1 and 

d— 1 

c 2 = v / 27tA:(0o) 3/2 ^2 k ,, (9 i y 1 / 2 m(9 i ). 

i=0 
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Proof. If follows from (29) that 

VC/(x) _ A(6) 


cos 9 
sin# 


(32) |VC/(x)| |A(#)|V 2 

It is clear that Assumption A1 is equivalent to the continuity of the function 
on the right-hand side of (32), which follows from the assumptions. That 
Assumptions A2 and A3 hold are immediate consequences of (29). It may be 
seen that = r a ~( l+ ^gij(6) for i + j = 2 and suitable functions 

gij(6). Assumption A4 follows from this and (30). To show Assumption A6, 
note that by (32) we have 

x VC/(x) \ ag(8) 

R’ |VC/(x)| / = IA(#)|V 2 ’ 

which is bounded away from zero since g is. 

We now verify Assumption A5. Using in order (31), (30) and Erdelyi 
[(1956), page 37], 

I n := [ e-K (£ ’ logri) -« ( ^’ logn)]r "[e(^,logn)^(C 0 ,logn)] 1/2 dC 

Ji 

r2 7T ~ _ _ 

= / e “K( 6l ' lo g n )“?( e, 0,logn)]r rl |-||'^^ l o g n )^(0 O) l o g n ^ 1 / 2 t , c ( n ) 1 /" m (0) d0 

Jo 

= v c (n) / 27r e - r " ,,c ^ 1_1/ “[ fc ^- fc ( eo )][fe(#)7’(#o)] 1/2 m(#)d# 

Jo 

< 2-1 


’ v c (n)k(d 0 )V2TT^2{r n v c (n) 1 1/a k"(9i)) l/2 m(0i). 


2=0 


Clearly v c (n) ~logn and by (25), (26) and (30), 

rnVcin ) 1 - 1 / 01 ^ 0 ) = r n ^(C 0 ,logn) ~ rj n . 

Hence 


< 2-1 


I n ~ V2TT{logn)k(6 0 ) 3/2 r] n 1/2 53(fc"(0i)) 1/2 m(8i 


2=0 


and Assumption A5(a) follows. Similar considerations prove Assumption A5(b). 

□ 


We next apply the result to classes of elliptically contoured distributions 
and to Weibull-type distributions. 

Example 1 . Consider elliptically contoured distributions with log den¬ 
sity 


C/(x) = r a ((cos#) 2 — 2 / ocos#sin# + (sin#) 2 )/d + c. 
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In particular, this includes the bivariate normal with standardized marginals 
and correlation p £ (—1,1), but p ^ 0, which is obtained for a = 2, C 2 = 
— log(27r(l — p 2 ) 1 / 2 ), and d = 2(1 — p 2 ). These distributions are of the form (28) 
with 

g(9) = ((cos0) 2 — 2/0 003 081110 + (sin0) 2 )/d = (1 — 2pcos0sin0)/d, 
and hence with 

k(0) = (1 — 2/Ocos0sin0) -1+1 /“ 

x yja 2 (l — 2pcos0sin0) 2 + (2 p — 4/0cos 2 0) 2 /d 1//a . 

The conditions of Theorem 6 are satisfied, and hence the results of The¬ 
orems 4 and 5 hold with 

/ooi _ log 2 n - log 3 n - log(c 2 ) , __1_ 

(logn) 1_1 /“fc(0o) a Un (logn) 1_1 /“fc(0o) 

Computation of fc(0o) and c 2 requires computer algebra, with a slight sim¬ 
plification obtained by noting that the constant c 2 is independent of the 
value of d. We consider the normal case, which has a = 2 and d = 2(1 — p 2 ), 
and present the results for a few values of p in Table 1. By symmetry, the 
values for —p are the same as for p. 

Example 2. In this example we consider independent marginals with 
Weibull-type densities const x e _ l :E:l l“ _ l X2 l a . The log density then is 

t/(x) = r“(| cos 01 01 + | sin0| Q ) + c 

and is of the form (28) with 

g{6) = (| cos 0|“ + | sin0| Q ). 

Hence, 

k(6) = a(| cos0| a + | sh^l 0 ) 1 " 1 /" 
x {(| cos0| Q + | sin0| Q ) 2 

x (—sign (cos 0) sin0| cos0|“ _1 + sign(sin0) cos0| sin0| a " 1 ) 2 } 1,/2 . 
To assure three times differentiability we assume that a > 4. 

Table 1 


p 

0.1 

0.3 

0.5 

0.7 

0.9 

k(9 o) 

1.348837 

1.240718 

1.153867 

1.084742 

1.026251 

C2 

19.2383720 

7.9460116 

4.0933240 

1.9501568 

0.5414317 
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Table 2 


a 5 6 7 8 

k(8 0 ) 0.769 0.472 0.280 0.162 

c 2 0.631 0.307 0.151 0.072 


Again the conditions of Theorem 6 are satisfied, and hence the results of 
Theorems 4 and 5 hold with /i n ,a n given by (33) in the previous example. 
A few examples of values of k(6o), C 2 , obtained with Maple, are given in 
Table 2. 

It might be noted that even if the result requires a > 4, it can be shown 
to hold also for 2 < a < 4. 

6. Parallel level curves. Consider the situation where the level curves 
are parallel, namely they are given by 

(34) {x: U(x) = u} = {c(t) + w(u)n(t): t £ [0, L)}, u> some u 0 > 0, 

where L is a finite positive constant, c (t) is a closed, strictly convex curve pa¬ 
rameterized clockwise by length, uj(u) is an increasing function with u>(u 0 ) = 0 
and u;(oo) = oo, and n(t) is the unit normal of c at the parameter t. Assume 
that uj(u) is differentiable and c (t) is twice continuously differentiable with 
|c(t)| e(0,oo), where refers to differentiation with respect to t. Since 
|c(t)| = 1, it is easy to see that 

(35) n(f) = |c(f)|c(t). 

Also it is clear that 

(36) ${e,u) = |VC/(x(^,u))| = for all t, 

u'(u) 

so that the asymptotic results here are different in nature from what was 
considered in Theorem 6. Since £(£, u) does not depend on £ we will denote 
it by £(u) henceforth. Define t(£) to be the inverse function of 

£(t) = arc length of the level curve (u = log n) 

from c(0) + a;(logn)n(0) to c(t) + w(logn)n(i) 

= / |c(u) +o;(logn)h(u)| dv = / [1 + w(logn)|c(u)|] dv, 

Jo Jo 

where the rightmost equality follows from (35) in conjunction with |c(u)| = 1. 
Hence, 


x (1, °) (£,u) = [c(t(£)) + u(u)h(t(£))]t'(£) 
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(37) 

= ii+u(u)\cmwmm 


1 + u(u)\c(t(£))\ 
l + w(logn)|c(f(f))| 

tm). 

Also 




A (u) = f [1 + Ld{u) c(t) ] dt 

J 0 


(38) 

rL 

f .L 


= L + uj{u) / c(t) \dt^ui{u) 

Jo 

/ \m 

1 0 


Theorem 7. Assume that (34) holds, where c(t) is twice continuously 
differentiable, and uj(u) = u a exp(fdy ) for some a > 0, a(y) —> 0 and 
ya'(y ) —> 0 as y —> oo. Then Assumptions A1-A6 hold with rj n = log[£(logn) x 
A(logro)], ci = 1/2 and C 2 = 1. 


Proof. Since Vf7(x)/|Vt/(x)| = n(f), it is not difficult to see that As¬ 
sumption A1 follows from the continuity of n(f). Both Assumptions A2 and A3 
hold trivially in view of (36) and the assumption on ui. We next verify As¬ 
sumption A5. By (36) again, 

pA(logn) 

e-^r 1/2 / e- [ ^’ logn) -^°’ log?l) ]^[e(^logn)e(4,logn)] 1/2 ^ 

Ji =o 


= e 1 / 2 A(logn)^(logn). 


Hence, Assumption A5(a) is satisfied for the choice of constants in this the¬ 
orem. The verification of Assumption A5(b) is entirely similar and therefore 
omitted. To verify Assumption A6, note that 


x W(x) 


c(t) + w(u)n(t) 


^|,n(t))~|n(t)| 2 = l 


as u —► oo. 


|x| ’ |Vt/(x)| / \ |c(t) + ui(u)i 

Hence Assumption A6 holds. 

We finally verify Assumption A4. Solving -§^) and (^-, -J^) in the 
following in terms of x\, X 2 - 

dxi dt . du 

1 = — = (ci+wrn)--hw m-—, 

OX i ox i ox i 

dx\ dt , du 

0 = -— = (ci +Loni)~ - \-uni -—, 

OX 2 OX 2 OX 2 

. 5x2 ,. . s dt . du 

0 = = (c 2 +Um 2 )--1WJ127-, 

axi axi axi 


dx 2 
dx 2 


. dt . du 

(c 2 +u;n 2 )-- hwn 2 -—, 

dx2 dx2 
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we get 


i du\ 1 / dt dt \ 1 

=— (-C 2 ,ci) and —— — (c 1 ,c 2 ). 
i 5x2/ ui' \dx i 0X2/ l + o;c 


5m 
dx 


Hence 


5 2 m _ cm®C2(5m/5xi) — u)'c2(dt/dxi) _ uj^c 2 c\C2 

dx 2 (u/) 2 (u/) 3 a/(l+u;|c|) ‘ 

5 2 m _ u/ 2 )ci(5m/5x2) — oj'c\(dt/dx2) _ o/ 2 )c 2 C 1 C 2 

5x| (a/) 2 (n/) 3 n/(l + w|c|) ’ 


5 2 


M 


uj^ci(du/dxi) — uj' c\(dt / dx\) uj^c\C 2 cici 


dx± 8 x 2 


(cm') 


02 


(cm') 3 o/(1 + w|c|) 1 


To show Assumption A4, by the computations above and the fact that 
|c| and |cj are bounded, it suffices to show that 


lim 


(39) 

The 

varying and in fact 


M 2) (m)| 


+ 


1 


(u/(u) log m) 2 = 0. 


u^oo \ [u)'(m)] 3 U){u)uj'{u) , 

The assumption of the theorem implies that both oj and a/ are regularly 


where 


J(u) = {ot + a{y 0 ))u a 1 exp(^J 

ya'iy) 


a(y) = a{y) + 


a + a(y) 


0 as y —> 00 . 


It is easily seen that 

u/(n) a + a(u ) 
uj{u) u o/(u) 

from which (39) is straightforward. □ 


and 


u^ 2 \u) a—l + a(u) 


u 


Example 3. An example of such a distribution is the bivariate normal 
distribution with independent standard normal marginals, in which case we 
can take c(t) =n (t) = (cos t, sin t), t £ [0, 2 n), and 

oj(u) = \l 2 [m — log(27r)] — 1, m > log(27r) + 1/2. 

The conditions of Theorem 7 are satisfied and so Assumptions A1-A6 hold 
with 


in = (21ogn) 1/2 , 


r] n = log47r + log 2 n, 


ci = 1/2, 


c 2 = 1. 
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Hence the results of Theorems 4 and 5 hold with 

_ log 2 ra- (1/2) log 3 n + log(2-y/27r) _ 1 

(2 log n) 1 / 2 a <T " (2 logn) 1 / 2 

This is consistent with the normalization obtained in Penrose (1998). 

7. Technical lemmas. In this section we present three technical lemmas 
which are essential for Theorems 4 and 5. No significant continuity will be 
lost if a reader postpones the details in the proofs of these lemmas during 
the first reading. 

The first one, Lemma 8, deduces three consequences of Assumption A4 
which are more directly usable in the proofs of the theorems. Lemma 10 
takes one step toward evaluating integrals like (1) and the last one, Lemma 9, 
approximates F(S(x] r n )), and in particular shows that appropriate sectors 
of S(x;r n ) may be neglected asymptotically. 

In addition to previous notation, we will use in the proofs 

(40) u n _ 6 = logn-&log 2 ra, u U)b = logn + Mog 2 n, b > 0. 


Lemma 8. Assume that Assumption A4 holds, namely lim[/( x )_ >0O (|17^’ J ) ( x )|/|VC/(x)| 2 )(logC/(x)) 2 = 
0 for any i, j > 0 with i + j = 2, and consider the coordinate system based 
on the level curve U{x) = logn. Then, for any finite constant b > 0, 


(A4a) 


lim 

n—>oc x: 


sup 

|G(x)-logn|<ftlog 2 n 


|[/(^')(x)| 

|W(x)| 2 


(log 2 n) 2 = 0 


for any i,j with i + j = 2. 


(A4b) 


lim sup 

n ~ >0 ° h-logn|< 61 og 2 n 


W,v) 

£,(l,\ogn) 


- 1 


log 2 n = 0. 


(A4c) 


lim sup | |x^ 1,0 ^(£,u)| — 1| = 0. 

n —*°° Id— logn|<felog 2 n 


Proof. Equation (A4a) is an immediate consequence of Assumption A4 
since log U (x)/ log 2 n —> 1 in the indicated range. 

The proof of parts (A4b) and (A4c) uses the relation (18), that ^ = 
VL r (x)/|VC/(x)| 2 , in two ways. First, this relation implies that 


(41) 


dxi(£,v) 


dv 


< 


1 


|W(x)| 


and 


dx 2 {£,v) 


dv 


< 


1 


|VC/(x)| 


and second, it is equivalent to 

(42) x(t?,w) — x(£, log n) = / 

J lc 


VU(x(£,s)) 


logn |W(x(£,s))| : 


ds. 
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Let 


\U^\x{£,v))\ 

M n ~ Slip |yjTTf (n 12 ’ 

\v— logn|<61og 2 n,i+j =2 I^ V))\ 

so that M n (log 2 n) 2 —> 0 by (A4a). Now, to prove (A4b), note that, by 
straightforward differentiation, 




d\\7U(x(£,v))\ 

dv 


dv 


= 



l£ + £|W(*fc.))|£ 

\{VU, (U^°\U^))(dxi/dv) + (VC/, (U^ 1 \U^))(dx 2 /dv)\ 

- iwj • 

Using (41), for \v — logn| < 61og 2 n we hence have that | c)los ^’ v ' > | < 4 M n . 
Thus, by integration, 

(e -4M„Mog 2 n_ 1) lQg n < J&V) _ x J „ < (e 4M„Mog 2 n _ 1} log „ 

4(t,log n) 

and (A4b) follows, since M n (log 2 n) 2 —> 0. 

Next, to prove (A4c), we note that similar calculations as for (43) give 
that 


d VU(x(£,v)) 


<5M n |x (1 ’ 0) (^,u)|. 


d£ | VC/ (x(£, u))| 2 

Hence, interchanging differentiation and integration in (42), we obtain 
\x^(£,v)-x^(£,logn)\<5M n T jx^(£,s)jds. 

•/log n 

By (19), |x (1, °)(^,logn)| = 1, and hence 

\\x^°\£,v)\-l\<5M n f 3 (||x( 1 ’°)(£,s)|-l| + l)ds. 

•/log n 

It then follows from a Gronvall inequality that for \v — logn| < & log 2 n, 
||x (1,0 ' ) (-£,u)| — 1| < 5M n 61og 2 nexp{5M n 61og 2 n} —► 0, 
and (A4c) follows, since M n (log 2 n) —> 0. □ 

Lemma 9. Suppose that U is differentiable with gradient VC/ and let, 
for some x, 


(43) 


e= sup | C/(y) — U (x) — (VU (x), y — x) |. 

yGS(x;r) 
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Let Q £ (—1,1]. Then, with £ = |VC/(x)|, there exist constants 6 \ = 9\(x,r) G 
[—1,1] and \ < 62 = d 2 (x, r) < 0 and sncd that 

F(5(x; r) n {y : (y - x, VC/(x)/| VC/(x)|) < O"}) 

= (27rr) 1 / 2 e- c/ Wr 3/2 e er e 01£ [l + 0 2 /(fr)]. 

Proof. Let -B = {y : (y — x, VC/(x)/|VC/(x)|) < £r}. By (43), we can 
write 

(44) F(S(x;r)nB) = e- [r(x) e Sl£ / e -<vtt(x), y -x> dy> 

is(x;r)ne 

Then change variables with y — x = Av where 


A = 


6 a 
—a b 


with (a, 6) / = VC/(x)/|VC/(x)|. It is easy to see that 
f p -(Vt/(x),y-x) 


/S(x;r)nB 

/ 

' S(0;r) 


[ I{v 2 < C r ) e dv 

J S(0',r) 

£~ 2 / I{v 2 < CC r )e~ V2 dv 

./S(0;£r) 

/ £Cr 

e~ v [{^r) 2 — n 2 ] 1/2 dn. 

-Sr 


I 

Next, letting z = n + £r, the previous expression is equal to 
r£(i+CP 


/o 


2C 2 e ir 

= 2C 2 e^ r {2ir 


e~ z {2£rz-z 2 ) 1/2 dz 
£(1+0 


e~ z z 1//2 (l — z/(2£r)) 1/z dz. 


V2 


Since 1 — x/2 < (1 — a:) 1 / 2 < 1 for x G [0,1], we have 

/■O+O i /-£(i+0 , 

/ e~ z z 1 / 2 dz-— e~ z z 3 ' 2 dz 

Jo 47o 

/■£(i+0 


< 


4£r 

e~ z z 1/2 (l- z/{2£r)) 1/2 dz 
JO 
roo 

< / e~ z z 1 ^ 2 dz = r(3/2). 

do 
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The lower bound is 
r (3/2) - 


/ e 4 1 / 2 dz - —- / e 4 3//2 dz, 

/(i+C)r 4 £r Jo 


and since T(3/2) = 7r 1 / 2 /2, we obtain that 


/ e 4 1/2 dz - — j 
/£(i+C)r 4 £r Jo 


1 /^(l+Or 


"4 3 / 2 dz 


< 


[ i{1+0r e -z z i/2(I _ z /{2^r)) l/2 dz - 7T 1/2 /2 < 0. 
Jo 


Now, 


e~ z z 1 ^ 2 dz <_1_ /' 

•Wir - m+c)r)Je 


e z/2 z 3/2 dz< _ 1 T(5/2) 


and 


(?(1 + C)r) J?(l+C)r ?(1 + C)^ 

1 /•£(i+C)r 1 

— / e~ z z 3 / 2 dz<—T(5/2). 

4£r Jo 4£r 

Since T(5/2) = 37r 1 / 2 /4 and 37r 1//2 /8 < 1, this concludes the proof. □ 


Lemma 10. Assume that Assumptions A2-A5 hold. Let constants k £ 
[0,oo) and 5 n £ [d, d] C (0,oo). T/ien /or any sufficiently large fixed b £ 
(0, oo), 

(45) n f J(£J(x) 0 K,_ 6 , n n , 6 ])e- 5 ^ F ( S ( x ^™)) [nF(S(x; r n ))] fc dF(x) - 0, 

J-x. 

and uniformly for 4.1 < 4,2 £ [0, A(logn)], 


n J I(U (x) £ [rx n _ 6 ,u nj6 ],J(x) £ [ 4 ,1,4,2]) 

(46) x e~ Sn nF(S( X ;r„)) [ nF (s(x; r n ))] k dF(x) 

_! //”£ x e -K(Llogn)-e(4,logn)]r n ^l/2^ 5 log „) M 

jA(Jogn) e _ [? (^ io gn )_^( £oi io gn )] r . n ^ i / 2 (^^og re )^ 

it follows from (45) and (46) that 

(47) n f e -^4S(x;r n )) [ nF ( 5 ( x; rn ))]* JF(x) ~ d^r. 

J X 


Proof. In this proof we will assume for convenience of notation that 
k = 0 and 5 n = 1, since the extension to the general case is straightforward. 
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First note that by Assumption A2, (A4b) of Lemma 8 and the definition 
of r n , 


(48) 


sup \£(£,u) - £(£,logn)\r r 

u£. \lL n ^_b^n,b] 


< sup 

fan, — b 5 ^n,b] ^ 


_ 1 

£{£,\ogn) 


0( log 2 n) -> 0 


and 


inf £(£,u)r n > ^(£ 0 ,\ogn)r n -> oo, 6 > 0 . 

[^n, — 

By Assumptions A4 and A2, 

lim sup sup |C/(y) —C/(x) —(VC/(x),y —x)| = 0, 6 > 0. 

>°° x . u(^)£[ Un _ btUn b ] yeS(x;r n ) 

Hence, by Lemma 9 with () = 1, (48) and (A4b) of Lemma 8 , we have uni¬ 
formly for all l and u € [u n ^b, u n ^}, 

F(S(x(£,uy,r n )) 

~ V2ne- u C 3/2 ^,u)r^ 2 e^’ u > n 

(49) 

~ v / 27re _ “^ _3 / 2 (^,logn)ry 2 e^^ 0 ’ logn ^ rn e^^’ logTl ^ _ ^^°’ logri ^ rri 
= e~ u Xn{£), 

where 

Xn (£)= C 2 T - 1 ^- 3 / 2 (£,log n ) r -V 2 e ^ r? - c i e [HL log n)-?(^, log n)]r„_ 

Now pick a large u a so that the bounds in Assumption A3 apply for u> u Q . 
By the nonintersection of level curves, Assumption A2 and (49), for all large 
n we have, for some constant b \ . 62 > 0 , 

inf F(S(x;r n )) 

x: ti D <(J(x)<u„ _i 

> ini F(S(x(£,u n _ b );r n )) 

> b\ inf e _ “ n ’- i '^" 3 / 2 (£, log n)rj/ 2 e Vn r/“ Cl 

> 62 ^ _ 1 (logn) 6 ^ _ 2 (£ 0 ,logn)e r?n r/y 2_Cl . 

It then follows from Assumption A3 that 

inf F(5(x;r n )) > fe 3 n _ 1 (logn ) 64 

x: u 0 <U (x)<tt n _i, 

for some constants 63 , 64 , where 64 can be picked large provided that 6 is 
large. Now for x for which t/(x) < u 0 , it is clear that F(S(x-,r n )) can be 
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bounded below by the same bound, since the density there is bounded away 
from 0. Combining the two cases we conclude that if b is large enough, we 
can pick 63 and 64 such that 


(50) 


n 


e -nF(S(x;r„)) di 7( x ) < ne ~H logn) 6 4 q 


/x: U(x)<u n -b 

Next, by (20) and Assumption A3, there exists a constant 65 such that 

,A(logn) | x ( 1 >°)(£,u)| 


n 


/x: U(x)>i 


dF(x) = n 


U>Un : bJ<i= 0 M^1 U ) 

-u MM 


■ didu 


(51) 


fl I g 

Ju>u nt b 
<b^n f e~ u u 2p du 

Ju>u„ h 


■ du 


J U^> 1 ln t b 
~ u n,b~, 2 P 


b^ne Un ’ b u, 


n,b 


0 . 


Hence 

(52) 


n 


/x: C/(x)>u„ i6 


e -nF(S(x;m)) dF ( x ) Q 


for a sufficiently large b. Now, (50) and (52) imply (45). 

It follows from (24), (49), and (A4b) and (A4c) of Lemma 8 that 

n J /(H(x) G K-^^l.ffx) € [4 1 i,4,2])e- nF(S(x;r)) dF(x) 


C u n,b 


fin, 2 


~ n 


e -n(l+o(l))e “Xn(<) e -u. 


1 


didu. 


Ju=u„ t -b J £=tn,i £(7,logn) 

Make a change of variables in the above integral with 

v = u - logn - log Xn {£) • 

By Assumptions A2, A3 and the assumption that rj n = log 2 (n) in Assump¬ 
tion A5, we conclude that log Xn(£) = 0(log 2 re). As a result, it is possible to 
choose large enough b such that v n _b —> —00 and v n ^ —> 00 , where v U) ±b cor¬ 
responds to u n> ±b with respect to the above variable change. Since 


—e u —v 


e v dv = 1, 


we have 


n j 7(C/(x) G [u n - b ,u n ^ b \,£(x.) G [4,1,4, 2 ])e " F(S(x;r,) dF(x) 
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f v n,b 


e -(l+o(l))e-» e -« 


'V — J 0, — i^n,\ 


~ c 0 Vm 1//2 e 7,71 n ci 


, r - 1 T- (0 - t ?n r) Cl-l/2 

c 2 r e Vra 


C(^,logn)xn(^) 


d£dv 


fin,2 


n=i 


e -[mosn)-W°,logn)\r n £l/2^ logn j di 


n, 1 


x / ’ e _ K(Li°g™)-£(L>,iogn)]r„, [£(^ log n )£(£ 0 , log n)] 1 / 2 d£ 


it'L ! e-K^ logn )- f ^ 0 ’ logn )] r7l C 1/2 (^,logn) d£ 

jA(logn) e -[e(£,logn)-e(£o,logn)]r n ^l/2^ ) l ogn )^ 

by Assumption A5. This proves (46) for k = 0 and d n = 1. □ 


8. Proofs of Theorems 4 and 5. We continue to use below the notation 
of u n% ±i defined in (40). 


Proof of Theorem 4. First consider the case N n ~ Poisson(n). We 
start by defining the sets A n \ ,..., A n ^ n in Theorem 1. For convenience write 


In(£) = 


t /to e_K {t " ,log n) (<? ° ,log n)]rn ? 1/2 (*, log n) dt 

ft=o° Sn ’ > e -K(h!og n)-Z(e o ,logn)]r n £l/2( t ^ ogn ^ fa 


i G [0, A(logra)). 


Thus, by (46) of Lemma 10 with k = 0 and 5 n = 1, for some b > 0 we have 
uniformly for 0 < 4i,i < 4i ,2 < A(logn), 

(53) /i n ({x(( 1l).f £ \£n,h £n,2)i U £ [Wn,-6i ^n,fc] }dn) ~ In(£n,2) In (In, 1) 

where /it is defined in (1). In the following we will continue to work with 
this choice of b. By Assumptions A5(a) and A5(b), 


£ n ■= sup [I n (£ + r n ) - In 
l 

Let k n be a sequence of integers such that 


0. 


oo and kn £n 


let jo = 0 and, for i = 1,... ,k n , let j, be the largest positive integer j such 
that I n (Jr n ) < ir/k n , where jk n is simply the largest positive integer j such 
that jr n < A(logn). Note that since £ n = o(l/fc n ), the differences between 
successive jjs tend to oo uniformly so that ji-i < ji — 3 for all large n. 
Dehne 


A n ,i = {x(^,u) :£ £ [ji-ir n , (ji - 3)r n ),u E [u n _ 6 ,u n)6 ]}, 1 < i < fen, 
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and 

An,k n +1 — A n \. 

Also let 

A n ,i {x(£, 'll) . £ G [(ji 3)r n , ji^n) > ^ ^ [^n,—6, ^n,b]}, 1 — * ^ kn 1, 

and 

A n ,k„ = {*{£,u):£e [(j kn - 3)r n , A(logn)),u G [it„ _ 6 , ix n , 6 ]}. 

By the choice of the sets, it follows from (53) and the definition of e n that 
(54) 1 /k n — 4 £ n < ^ (A ni j, r n ) < l/k n s n , 1 ^ i ^ k n , 


(55) max /xW(A n j,r n ) < 3e„ and //^ (A n fcn , r„) < 3e„ + l//c n , 

l<z</c n —1 


and hence that 
(56) 


l^n (-4n,i)^'n) , 

max —- : - < 


3c fl 


1 ^(A^ijTVj) V&n 


0. 


We now proceed to verify the conditions (a)-(e) of Theorem 1. 
By convexity of the level curves, 


Cn,i ■= inf(|x — y| : x G 4 fl ,i,yG A nji+1 ) = |x((jj - 3 )r n ,u n - b ) - x.(jir n ,u n - b )|. 


By (20) and the condition (A4c) of Lemma 8, the length a n ^ of the arc 
that connects x((j) - 3)r n , u n _ 6 ) with x(jir n ,u n - b ) on {x : C/(x) = u n _&} 
is asymptotically 3 r n . By the same token, A (u n - b ) ~ A(logn). Let 9 n ^ be 

Vf/(x(Li— 3)r„,u„,_i,)) j VU(x(jir„,u ni ^ b )) 

VCt(x((ji-3)r„- allCt 


the angle between 


Since r„ = 


-i,-b))l |Vt7’(x(j' j r n ,t4 n| _ I ,)) 

o(A(logn)) by Assumptions A5(a) and A5(b), Assumption Al then guar¬ 
antees that rriaxj 9 n ^ —> 0. Consider the triangle with base equal to the line 
that connects x((j) — 3 )r n ,u n - b ) and x(jir n ,u n - b ), and with sides deter¬ 
mined by the tangents at these two points on the curve ?7(x) = u n - b ; let 
the two base angles be 9\ and 02, say, and the corresponding two sides have 
lengths si, S 2 , respectively. Figure 1 depicts what is described here. 

Then 


(si + S2) cos 9n t i _ (si + S 2 ) COS(01 + 0 2 ) ^ «1 COS 01 + S 2 COS 02 _ 1 

\ L, 


and so 

CLn i Si -{- So 1 „ , 

—- <-<---> 1 uniformly as n —> 00 . 

Cn,i C-n,i COS U n ,i 


Consequently, minj c n j > 2 r n for large n. Hence we have shown that adja¬ 
cent A Ut i s are at least 2 r n apart for large n. That nonadjacent A n /s are at 
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least 2 r n apart for large n can also be established by the convexity of the 
level curves. Hence the condition (a) of Theorem 1 is proved. 

It follows from (47) of Lemma 10 with 5 n = 1 and k = 0 that the condition 
(b) of Theorem 1 holds. To prove (c) of Theorem 1, note that it suffices to 
show that 

kn 

/j,^{-x(£,u):ug[u n -b,u nt b]}-> 0 and ^ ii^\A n ^r n ) -> 0. 

3 =1 

The first convergence follows from (45) of Lemma 10 with 5 n = 1 and k = 0, 
while the second convergence follows from (55) and (56), in conjunction with 
the condition (b) already proved above. Condition (d) of Theorem 1 holds 
by (54). 

Finally we prove (e) of Theorem 1. By the simple inequality 
P{A U B)> ( P(A ) + P(B))/2 


we have 

/4j 2) (A,r n ) < (nj A e- nF ^ r ^/ 2 dF(^y 

so that 


H { n\A n ^r n ) < (rif An .e ^fe))/ 2^ (x)) 2 


/^n (-^4-71,2? ^*72) 


n /a e nF ( s ( x ’ rn ')') dF(x) 
By (46) of Lemma 10 with 5 n = 1/2,1 and k = 0, and (54), 


( u 5a 


?—nF(S(x\r n ))/2 


dF(x)) 5 


n f A . e -«E(S(x;r„)) dF ( x ) 


(A n>i ,r n ) 0 


uniformly in i, where t is the constant specihed in the theorem in defining r n . 
Hence (e) of Theorem 1 is proved. Replacing r by e~ x completes the proof 
for the case N n ~ Poisson(n). 



Fig. 1. 
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Next, 

P(M N NG(7V n ) < r n) 

< -P(M NNG ( n ) < r n ) + P(M NNG ( JVri ) <r n < M NNG ( n )) 

< P(M NNG (jv n ) < r n ) + P(M NNG(n) <r n < M NNG (tv„)) 

+ P(-^NNG(JV n ) < r n < ^NNG(n))- 

Using the inequality 1 — x < e -1 , x < 1, it follows from (47) of Lemma 10 
with k = 0, (5 n = 1 and k = 1, 5 n = 5, where <5 6 (0,1), that (12) and (13) 
hold. Taking limits throughout the preceding inequalities, it follows from 
Theorem 2 and the first part of the proof with N n ~ Poisson(n) that 

Ji^ P ( M NNGW - r ™) = J^ P ( M NNG(iV„) < r n ) = e~ T - 

Applying the above argument again gives 

i( lm p ( M N NG (J v „) < r n ) = lur^P{M NNG ( n) < r n ) = e~ T 

for any general N n satisfying N n /n-^l. Replacing r by e~ x completes the 
proof. 

□ 


Proof of Theorem 5. It follows from Theorems 3 and 4 that 
P( M MST(N n ) < r n ) = n lim ) P(MN NG (j Vn ) < r n ) = e~ T 
provided we show (16). To do that, it suffices to show that 

(57) n J F Snn (S(x;r n ))I( |x| > r n /2) dF(x) ->■ r, 
and 

(58) n J F <5?in (S'(x;r n ) n 5(0; |x|))/(|x| > r n /2)dF(x) -> r 

for any sequence 5 n —> 1. For any fixed e > 1, it follows from Assumption A6 
that for all large n, 


(59) 


2|x(^, Un,—b) I — ^n,— b) I T |x(^, U n — e ij) \ 

P x(^, U n —i)) x(^, U n ^— e i)) |. 

By (18) and (A4b) of Lemma 8, 

VU(x(f,r)) 


x 


'I'U'rij—b) ^-{^ 1 'U , n,—eb) \ 


Un.-rf, |VU(x(^,u))|2 


dv 


^ 2 (£,log n) 


C u n, — b 


S7U(x.(£,v))dv 


u n, — eb 








30 


T. HSING AND H. ROOTZEN 


Now write 
ru n ,-b 

^ u n, — eb 


Since 


VU(x(e,v))dv = 


f‘ u n, — b 


^n, — eb 

P^n. — b 


+ 


VC/(x(.C,logn)) dv 

[VC/(x(£, v)) — VC/(x(C, log n))} dv. 


' u n, — eb 


C u n, — b 


VC/(x(C,logn)) du 


' u n, — eb 

if we can show 


= C(C,logn)(e- l)61og 2 n, 


(60) 


|VC/(x(l, u)) — VC/(x(l, logn))| 
^(^,logn) 


then it follows at once that 


(61) 


l x (C, u n _b) -x(£,u n _ eb )I ~ 


(e- l)61og 2 n 


£ (A log n ) 

We now show (60). By the mean value theorem, 

|VC/(x(£, u)) - VC/(x(£, logn))| 


sup 

V€z\u n , — e b , — b\ 


£(C,logn) 


(62) 


< 


e&log 2 n 
£{£,\ogn) 


sup 

VG ['CC’n.j — ebj'^n, — b] 


|-C/( 1 ’°)(x(C,u)) 


+ sup 

V£[ u n, — eb^n. — b] 


d_ 

dv 


C/^x^-u)) 


As in the proof of (A4b) in Lemma 8, by (41) we have 

ov ov Ov 


< 


|VC/(x(C,u))| 
which, by (A4a) and (A4b), gives 


sup 

V£['U'n, — eb:'U'n, — b\ 


^-U^°\x(£,v)) 


(|C/( 2 ’°)(x(C,u))[ + |C/( 1 ’ 1 )(x(C,u))|), 
/ £(C,logn) 


= o 


V (log 2 n ) 2 / 


The same conclusion can be reached for sup^g^ _ ebtUn _ 6 ] |^C/( 0,1 ^(x(C, u))|. 
Thus, (60) follows from (62), and (61) is proved. Since r n < r\ n l^ n for large 
n where r\ n = 0(log 2 n), we conclude by (59), (60) and Assumption A2 that 

liminf inf |x(C, u n _ b)\/ r n > cb 

n —>oo £ ’ 






















EXTREMES ON TREES 


31 


for some finite constant c independent of b. Hence we can choose b sufficiently 
large to ensure that infy | x.(£,u n -b)\/r n > 1. By this and (45) of Lemma 10 
it is the case that for a large enough b, 

n j F 5nn (S(x-,r n ))I( |x| <r n /2)dF(x) 

<n [ F 5 ™ n (5(x; r n )) dF(x) -> 0 

so that (57) follows from Lemma 10. 

We next consider the proof of (58). We will do so by proving that for a 
6>0, sufficiently large, 

(63) nf F Snn (S(x.} r n ) n 5(0; |x|))/(|x| > r n /2) dF(x) —► 0, 

7x: U(x)>u ni b 

(64) nf F Snn (S(x\ r n ) C 5(0; |x|))/(|x| > r n /2) dF(x) —> 0 

7x: U(x)<u n - b 

and 

n f F 5 " n (S(x;r n ) n S(0; |x|))/(|x[ >r n /2)dF(x) -> r. 

4x: u n -b<U(-x)<u n fi 

(65) 

Clearly (63) follows from (51) in the proof of Lemma 10. To deal with (64), 
first write 

n f F Snn (S(x.; r n ) n 5(0; |x|))/(|x| > r n /2) dF(x) 

4x: U{x)<u n - b 

= n f F <5 " n (5(x;r n )nS , (0;|x|))/(|x| >r n /2)dF{x) 

7x: C/(x)<u n> _ 6 /2 

+ n f F' 5n?l (5(x;r ri )n5(0;|x|))/(|x| >r n /2)dF(x). 

JU n ^ — b/^ — U (x)<CWtj, ? — 5 

We will show that both terms on the right-hand side tend to 0. Since 
/(x) = e ~ u ^, by Assumptions A2, A3 and the mean value theorem, there 
exists some b\ £ (0,oo) such that 

inf f( y) > e _ '“«,-b/ 2_6l €(^°’ 1 og n ) r '« > e - u n,-f»/ 2 - & i»7n. 

y6S(x;r„),[/(x)<«„_),/2 

Also observe that for x such that |x| > r n /2, 

area(5(x; r n ) n 5(0; |x|)) > ^vrr^ 
for some b 2 £ (0,1). Hence, for all large n, 

inf F(5(x;r n ) n 5(0; |x|)) > b^r 2 n e~^l 2 - b ^ 

x: |x|>r n /2,t/(x)<u Ili _i,/2 
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and hence 


( 66 ) 


n / F <5nTl (5(x;r ri ) PI 5(0; |x|))/(|x| > r n /2)dF(x) 

2x: C/(x)<M ni _ b /2 

< nexp{—5 n nb2Trr^ l e~ Uri ’- b ^‘ 2 ~ birin } —> 0 . 


By Assumption A6, there exists some constant ( G (—1,0) such that for all 
x with U(x) > u n _b/2 , 

5(x;r n ) n|y:^y-x, ^ < C^nj C 5(x; r n ) n 5(0; |x|). 

Hence 


.inf , F(5(x;r„)n5(0;|x|)) 

XI 'U'n, — 


(67) > inf 

xi i£ n) _jj/2^t/(x)<Cw n? _5 


F^5(x;r n )n|y:^y-x, 


V[/(x) 

IvcM 


< 


> infF(5(x(£,u n _b);r n ) 


€ V 


n|y:^y-x(^,u n _ 6 ), 


VH (x(^, u n -b)) 


< C’G 


|Vt/(x(^ n ,_ 6 ))| 

It follows from Lemma 9 that uniformly for u G u ni b] and i, 

WU(x{£,u)) 


( 68 ) 


F^S(x(£,u)-,r n ) n jx: ^x - x(£,u), 


< C,r r 


\VU(x(£,u))\ 

F(5(x(£, log n);r n )). 

By (67) and (68), the same proof that leads to (50) in Lemma 10 now proves 


(69) 



F' S ^(5(x;r ri )n5(0;|x|)) 

^n,— b/^ — U (x)<C U n , — b 


x I(r n /2 < x < u n - b ) dF(x) -> 0. 

Hence (64) follows from (66) and (69). Making use of (68), the proof of (65) 
mirrors that of (46) of Lemma 10 and is omitted. □ 
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